AITopics | low-level feedback

Collaborating Authors

low-level feedback

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

2ff26b12ade4282de80c2461e447c373-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 16:15:19 GMT

machine learning, natural language, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning

Neural Information Processing SystemsDec-24-2025, 19:38:29 GMT

A key source of complexity in next-generation AI models is the size of model outputs, making it time-consuming to parse and provide reliable feedback on. To ensure such models are aligned, we will need to bolster our understanding of scalable oversight and how to scale up human feedback. To this end, we study the challenges of scalable oversight in the context of goal-conditioned hierarchical reinforcement learning. Hierarchical structure is a promising entrypoint into studying how to scale up human feedback, which in this work we assume can only be provided for model outputs below a threshold size. In the cardinal feedback setting, we develop an apt sub-MDP reward and algorithm that allows us to acquire and scale up low-level feedback for learning with sublinear regret. In the ordinal feedback setting, we show the necessity of both high-and low-level feedback, and develop a hierarchical experimental design algorithm that efficiently acquires both types of feedback for learning. Altogether, our work aims to consolidate the foundations of scalable oversight, formalizing and studying the various challenges thereof.

artificial intelligence, machine learning, scalable oversight, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning

Neural Information Processing SystemsOct-9-2025, 22:32:49 GMT

To this end, we study the challenges of scalable oversight in the context of goal-conditioned hierarchical reinforcement learning.

algorithm, arxiv preprint arxiv, low-level feedback, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning

Neural Information Processing SystemsMay-26-2025, 20:28:58 GMT

artificial intelligence, hierarchical reinforcement learning, machine learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)

Add feedback

Subgraph Extraction-based Feedback-guided Iterative Scheduling for HLS

Ye, Hanchen, Pan, David Z., Leary, Chris, Chen, Deming, Xu, Xiaoqing

arXiv.org Artificial IntelligenceJan-22-2024

Abstract--This paper proposes ISDC, a novel feedback-guided iterative system of difference constraints (SDC) scheduling algorithm for high-level synthesis (HLS). ISDC leverages subgraph extraction-based low-level feedback from downstream tools like logic synthesizers to iteratively refine HLS scheduling. Technical innovations include: (1) An enhanced SDC formulation that effectively integrates low-level feedback into the linear-programming (LP) problem; (2) A fanout and window-based subgraph extraction mechanism driving the feedback cycle; (3) A no-human-inloop ISDC flow compatible with a wide range of downstream tools and process design kits (PDKs). Evaluation shows that ISDC reduces register usage by 28.5% against an industrial-strength open-source HLS tool. Scheduling is one of the most important problems in highlevel synthesis (HLS) that partitions a computation graph into multiple clock cycles under the given timing and resource Figure 1: Post-synthesis STA vs. XLS-estimated critical path constraints. In 2006, Cong and Zhang [1] proposed a scheduling delay of 6912 different HLS design points.

algorithm, iteration, subgraph, (13 more...)

arXiv.org Artificial Intelligence

2401.12343

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.35)

Add feedback